[A preliminary prediction model of depression based on whole blood cell count by machine learning method]

Zhonghua Yu Fang Yi Xue Za Zhi. 2023 Nov 6;57(11):1862-1868. doi: 10.3760/cma.j.cn112150-20221202-01169.
[Article in Chinese]

Abstract

This study used machine learning techniques combined with routine blood cell analysis parameters to build preliminary prediction models, helping differentiate patients with depression from healthy controls, or patients with anxiety. A multicenter study was performed by collecting blood cell analysis data of Beijing Chaoyang Hospital and the First Hospital of Hebei Medical University from 2020 to 2021. Machine learning techniques, including support vector machine, decision tree, naïve Bayes, random forest and multi-layer perceptron were explored to establish a prediction model of depression. The results showed that based on the blood cell analysis results of healthy controls and depression group, the accuracy of prediction model reached as high as 0.99, F1 was 0.975. Receiver operating characteristic curve area and average accuracy were 0.985 and 0.967, respectively. Platelet parameters contributed mostly to depression prediction model. While, to random forest differential diagnosis model based on the data from depression and anxiety groups, prediction accuracy reached 0.68 and AUC 0.622. Age, platelet parameters, and average volume of red blood cells contributed the most to the model. In conclusion, the study researched on the prediction model of depression by exploring blood cell analysis parameters, revealing that machine learning models were more objective in the evaluation of mental illness.

通过机器学习算法挖掘常规检验的血细胞分析参数,构建抑郁症诊断及与焦虑症鉴别诊断的预测模型。利用2020—2021年北京朝阳医院和河北医科大学第一医院血细胞分析参数进行研究,收集研究对象22项血细胞分析检验项目及年龄、性别参数,探索采用支持向量机、决策树、朴素贝叶斯、随机森林和多层感知器多种机器学习技术构建抑郁症预测模型。结果显示,基于健康人群和抑郁症组血细胞分析参数,构建的随机森林模型预测抑郁症发生准确率高达0.99,F1为0.975,受试者工作特征曲线下面积和平均精准度分别为0.985和0.967。血小板参数变化是影响抑郁症发生的重要因素。而基于抑郁症组和焦虑症组的血细胞分析数据,构建的随机森林鉴别诊断模型显示出最高的预测准确性(0.68)和AUC(0.622)。年龄和血细胞参数及红细胞平均体积对该模型贡献最大。综上,本研究通过挖掘血细胞分析项目,初步建立了抑郁症预测及与焦虑症的鉴别诊断模型,显示了机器学习模型在精神疾病中更为客观的评估价值。.

Publication types

  • Multicenter Study
  • English Abstract

MeSH terms

  • Bayes Theorem
  • Blood Cell Count
  • Depression*
  • Humans
  • Machine Learning*
  • Support Vector Machine