Machine Learning on Early Diagnosis of Depression

Psychiatry Investig. 2022 Aug;19(8):597-605. doi: 10.30773/pi.2022.0075. Epub 2022 Aug 24.

Abstract

To review the recent progress of machine learning for the early diagnosis of depression (major depressive disorder). The source of data was 32 original studies in the Web of Science. The search terms were "depression" (title) and "random forest" (abstract). The eligibility criteria were the dependent variable of depression, the interventions of machine learning (the decision tree, the naïve Bayesian, the random forest, the support vector machine and/or the artificial neural network), the outcomes of accuracy and/or the area under the receiver operating characteristic curve (AUC) for the early diagnosis of depression, the publication year of 2000 or later, the publication language of English and the publication journal of SCIE/SSCI. Different machine learning methods would be appropriate for different types of data for the early diagnosis of depression, e.g., logistic regression, the random forest, the support vector machine and/or the artificial neural network in the case of numeric data, the random forest in the case of genomic data. Their performance measures reported varied within 60.1-100.0 for accuracy and 64.0-96.0 for the AUC. Machine learning provides an effective, non-invasive decision support system for early diagnosis of depression.

Keywords: Depression; Early diagnosis; Machine learning.