Identifying neuropsychiatric disorders using unsupervised clustering methods: Data and code

Data Brief. 2018 Feb 2:22:570-573. doi: 10.1016/j.dib.2018.01.080. eCollection 2019 Feb.

Abstract

This article provides data for five different neuropsychiatric disorders-Attention Deficit Hyperactivity Disorder, Alzheimer's Disease, Autism Spectrum Disorder, Post-Traumatic Stress Disorder, and Post-Concussion Syndrome-along with healthy controls. The data includes clinical diagnostic labels, phenotypic variables, and resting-state functional magnetic resonance imaging connectivity features obtained from individuals. In addition, it provides the source MATLAB codes used for data analyses. Three existing clustering methods have been incorporated into the provided code, which do not require a priori specification of the number of clusters. A genetic algorithm based feature selection method has also been included to find the relevant subset of features and clustering the subset of data simultaneously. Findings from this data set and further detailed interpretations are available in our recent research study (Zhao et al., 2017) [1]. This contribution is a valuable asset for performing unsupervised machine learning on fMRI data to investigate the correspondence of clinical diagnostic grouping with the underlying neurobiological/phenotypic clusters.

Keywords: Clustering; Effective connectivity; Functional connectivity; Functional magnetic resonance imaging; Genetic algorithm; Psychiatric disorders; Unsupervised learning.