A Dataset of 10,000 Situations for Research in Computational Social Sciences Psychology and the Humanities

Sci Data. 2023 Jul 29;10(1):505. doi: 10.1038/s41597-023-02406-6.

Abstract

It has been realized that situational dimensions, as represented by human beings, are crucial for understanding human behavior. The Riverside Situational Q (RSQ) is a tool that measures the psychological properties of situations. However, the RSQ-4 includes only 90 items and may have limited use for researchers interested in measuring situational dimensions using a computational approach. Here we present a corpus of 10,000 artificially generated situations corresponding mostly with the RSQ-4. The dataset was generated using GPT, the state-of-the-art large language model. The dataset validity is established through inter-judge reliability, and four experiments on large datasets support its quality. The dataset and the code used for generating 100 situational dimensions may be useful for researchers interested in measuring situational dimensions in textual data.

Publication types

  • Dataset
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, P.H.S.