Truth or lie: Exploring the language of deception

PLoS One. 2023 Feb 2;18(2):e0281179. doi: 10.1371/journal.pone.0281179. eCollection 2023.

Abstract

Lying appears in everyday oral and written communication. As a consequence, detecting it on the basis of linguistic analysis is particularly important. Our study aimed to verify whether the differences between true and false statements in terms of complexity and sentiment that were reported in previous studies can be confirmed using tools dedicated to measuring those factors. Further, we investigated whether linguistic features that differentiate true and false utterances in English-namely utterance length, concreteness, and particular parts-of-speech-are also present in the Polish language. We analyzed nearly 1,500 true and false statements, half of which were transcripts while the other half were written statements. Our results show that false statements are less complex in terms of vocabulary, are more concise and concrete, and have more positive words and fewer negative words. We found no significant differences between spoken and written lies. Using this data, we built classifiers to automatically distinguish true from false utterances, achieving an accuracy of 60%. Our results provide a significant contribution to previous conclusions regarding linguistic deception indicators.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Deception
  • Language*
  • Linguistics
  • Speech
  • Vocabulary*

Grants and funding

Research was funded by The National Science Centre Poland(https://www.ncn.gov.pl/en), grant number UMO-2017/26/D/HS6/00212 awarded to JS-W. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.