X-CHAR: A Concept-based Explainable Complex Human Activity Recognition Model

Jeya Vikranth Jeyakumar; Ankur Sarker; Luis Antonio Garcia; Mani Srivastava

doi:10.1145/3580804

X-CHAR: A Concept-based Explainable Complex Human Activity Recognition Model

Proc ACM Interact Mob Wearable Ubiquitous Technol. 2023 Mar;7(1):17. doi: 10.1145/3580804. Epub 2023 Mar 28.

Authors

Jeya Vikranth Jeyakumar¹, Ankur Sarker¹, Luis Antonio Garcia², Mani Srivastava¹

Affiliations

¹ University of California Los Angeles, USA.
² University of Southern California, Information Sciences Institute, USA.

Abstract

End-to-end deep learning models are increasingly applied to safety-critical human activity recognition (HAR) applications, e.g., healthcare monitoring and smart home control, to reduce developer burden and increase the performance and robustness of prediction models. However, integrating HAR models in safety-critical applications requires trust, and recent approaches have aimed to balance the performance of deep learning models with explainable decision-making for complex activity recognition. Prior works have exploited the compositionality of complex HAR (i.e., higher-level activities composed of lower-level activities) to form models with symbolic interfaces, such as concept-bottleneck architectures, that facilitate inherently interpretable models. However, feature engineering for symbolic concepts-as well as the relationship between the concepts-requires precise annotation of lower-level activities by domain experts, usually with fixed time windows, all of which induce a heavy and error-prone workload on the domain expert. In this paper, we introduce X-CHAR , an eXplainable Complex Human Activity Recognition model that doesn't require precise annotation of low-level activities, offers explanations in the form of human-understandable, high-level concepts, while maintaining the robust performance of end-to-end deep learning models for time series data. X-CHAR learns to model complex activity recognition in the form of a sequence of concepts. For each classification, X-CHAR outputs a sequence of concepts and a counterfactual example as the explanation. We show that the sequence information of the concepts can be modeled using Connectionist Temporal Classification (CTC) loss without having accurate start and end times of low-level annotations in the training dataset-significantly reducing developer burden. We evaluate our model on several complex activity datasets and demonstrate that our model offers explanations without compromising the prediction accuracy in comparison to baseline models. Finally, we conducted a mechanical Turk study to show that the explanations provided by our model are more understandable than the explanations from existing methods for complex activity recognition.

Keywords: Activity recognition; Explainable AI; Interpretability; Neural networks.

Grants and funding

P41 EB028242/EB/NIBIB NIH HHS/United States