Speech-Based Activity Recognition for Trauma Resuscitation

Jalal Abdulbaqi; Yue Gu; Zhichao Xu; Chenyang Gao; Ivan Marsic; Randall S Burd

doi:10.1109/ichi48887.2020.9374372

Speech-Based Activity Recognition for Trauma Resuscitation

IEEE Int Conf Healthc Inform. 2020 Nov-Dec:2020:10.1109/ichi48887.2020.9374372. doi: 10.1109/ichi48887.2020.9374372. Epub 2021 Mar 12.

Authors

Jalal Abdulbaqi¹, Yue Gu¹, Zhichao Xu¹, Chenyang Gao¹, Ivan Marsic¹, Randall S Burd²

Affiliations

¹ Department of Electrical and Computer Engineering Rutgers, The State University of New Jersey Piscataway, NJ, USA.
² Trauma and Burn Surgery Children's National Medical Center Washington, DC, USA.

Abstract

We present a speech-based approach to recognize team activities in the context of trauma resuscitation. We first analyzed the audio recordings of trauma resuscitations in terms of activity frequency, noise-level, and activity-related keyword frequency to determine the dataset characteristics. We next evaluated different audio-preprocessing parameters (spectral feature types and audio channels) to find the optimal configuration. We then introduced a novel neural network to recognize the trauma activities using a modified VGG network that extracts features from the audio input. The output of the modified VGG network is combined with the output of a network that takes keyword text as input, and the combination is used to generate activity labels. We compared our system with several baselines and performed a detailed analysis of the performance results for specific activities. Our results show that our proposed architecture that uses Mel-spectrum spectral coefficients features with a stereo channel and activity-specific frequent keywords achieve the highest accuracy and average F1-score.

Keywords: activity recognition; audio classification; keyword; speech processing; trauma resuscitation.

Grants and funding

R01 LM011834/LM/NLM NIH HHS/United States