Machine learning item selection for short scale construction: A proof-of-concept using the SIMS

Graziella Orrù; Barbara De Marchi; Giuseppe Sartori; Angelo Gemignani; Cristina Scarpazza; Merylin Monaro; Cristina Mazza; Paolo Roma

doi:10.1080/13854046.2022.2114548

Machine learning item selection for short scale construction: A proof-of-concept using the SIMS

Clin Neuropsychol. 2023 Oct;37(7):1371-1388. doi: 10.1080/13854046.2022.2114548. Epub 2022 Aug 26.

Authors

Graziella Orrù¹, Barbara De Marchi², Giuseppe Sartori³, Angelo Gemignani¹, Cristina Scarpazza³, Merylin Monaro³, Cristina Mazza⁴, Paolo Roma⁵

Affiliations

¹ Department of Surgical, Medical, Molecular & Critical Area Pathology, University of Pisa, Pisa, Italy.
² Department of Neuroscience and Rehabilitation, University of Ferrara, Ferrara, Italy.
³ Department of General Psychology, University of Padua, Padua, Italy.
⁴ Department of Neuroscience, Imaging and Clinical Sciences, G. d'Annunzio University of Chieti-Pescara, Chieti, Italy.
⁵ Department of Human Neuroscience, Sapienza University of Rome, Rome, Italy.

PMID: 36017966
DOI: 10.1080/13854046.2022.2114548

Abstract

ObjectiveThis proof-of-concept paper provides evidence to support machine learning (ML) as a valid alternative to traditional psychometric techniques in the development of short forms of longer parent psychological tests. ML comprises a variety of feature selection techniques that can be efficiently applied to identify the set of items that best replicates the characteristics of the original test. MethodsIn the present study, we integrated a dataset of 329 participants from published and unpublished datasets used in previous research on the Structured Inventory of Malingered Symptomatology (SIMS) to develop a short version of the scale. The SIMS is a multi-axial self-report questionnaire and a highly efficient psychometric measure of symptom validity, which is frequently applied in forensic settings. Results State-of-the-art ML item selection techniques achieved a 72% reduction in length while capturing 92% of the variance of the original SIMS. The new SIMS short form now consists of 21 items. ConclusionsThe results suggest that the proposed ML-based item selection technique represents a promising alternative to standard psychometric correlation-based methods (i.e. item selection, item response theory), especially when selection techniques (e.g. wrapper) are employed that evaluate global, rather than local, item value.

Keywords: Machine learning; SIMS; feigned psychopathology; psychological test; short scale construction.

MeSH terms

Humans
Malingering* / diagnosis
Neuropsychological Tests
Psychometrics
Reproducibility of Results
Self Report
Surveys and Questionnaires