Process models of interrelated speech intentions from online health-related conversations

Artif Intell Med. 2018 Sep:91:23-38. doi: 10.1016/j.artmed.2018.06.007. Epub 2018 Jul 18.

Abstract

Being related to the adoption of new beliefs, attitudes and, ultimately, behaviors, analyzing online communication is of utmost importance for medicine. Multiple health care, academic communities, such as information seeking and dissemination and persuasive technologies, acknowledge this need. However, in order to obtain understanding, a relevant way to model online communication for the study of behavior is required. In this paper, we propose an automatic method to reveal process models of interrelated speech intentions from conversations. Specifically, a domain-independent taxonomy of speech intentions is adopted, an annotated corpus of Reddit conversations is released, supervised classifiers for speech intention prediction from utterances are trained and assessed using 10-fold cross validation (multi-class, one-versus-all and multi-label setups) and an approach to transform conversations into well-defined, representative logs of verbal behavior, needed by process mining techniques, is designed. The experimental results show that: (1) the automatic classification of intentions is feasible (with Kappa scores varying between 0.52 and 1); (2) predicting pairs of intentions, also known as adjacency pairs, or including more utterances from even other heterogeneous corpora can improve the predictions of some classes; and (3) the classifiers in the current state are robust to be used on other corpora, although the results are poorer and suggest that the input corpus may not sufficiently capture varied ways of expressing certain speech intentions. The extracted process models of interrelated speech intentions open new views on grasping the formation of beliefs and behavioral intentions in and from speech, but in-depth evaluation of these conversational models is further required.

Keywords: Conversation analysis; Conversational processes; Intention mining; Machine learning; Natural language processing; Process mining; Speech acts; Speech intentions; Text mining.

MeSH terms

  • Communication
  • Consumer Health Information / methods*
  • Data Mining / methods*
  • Humans
  • Information Seeking Behavior*
  • Intention
  • Internet*
  • Machine Learning
  • Natural Language Processing
  • Speech*