Analysis of spontaneous speech in Parkinson's disease by natural language processing

Parkinsonism Relat Disord. 2023 Aug:113:105411. doi: 10.1016/j.parkreldis.2023.105411. Epub 2023 Apr 26.

Abstract

Introduction: Patients with Parkinson's disease (PD) encounter a variety of speech-related problems, including dysarthria and language disorders. To elucidate the pathophysiological mechanisms for linguistic alteration in PD, we compared the utterance of patients and that of healthy controls (HC) using automated morphological analysis tools.

Methods: We enrolled 53 PD patients with normal cognitive function and 53 HC, and assessed their spontaneous speech using natural language processing. Machine learning algorithms were used to identify the characteristics of spontaneous conversation in each group. Thirty-seven features focused on part-of-speech and syntactic complexity were used in this analysis. A support-vector machine (SVM) model was trained with ten-fold cross-validation.

Results: PD patients were found to speak less morphemes on one sentence than the HC group. Compared to HC, the speech of PD patients had a higher rate of verbs, case particles (dispersion), and verb utterances, and a lower rate of common noun utterances, proper noun utterances, and filler utterances. Using these conversational changes, the respective discrimination rates for PD or HC were more than 80%.

Conclusions: Our results demonstrate the potential of natural language processing for linguistic analysis and diagnosis of PD.

Keywords: Linguistic analysis; Natural language processing; Parkinson's disease; Part-of-speech; Spontaneous speech.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Communication
  • Humans
  • Language
  • Natural Language Processing
  • Parkinson Disease*
  • Speech* / physiology