Identifying depression in the National Health and Nutrition Examination Survey data using a deep learning algorithm

J Affect Disord. 2019 Oct 1:257:623-631. doi: 10.1016/j.jad.2019.06.034. Epub 2019 Jul 4.

Abstract

Background: As depression is the leading cause of disability worldwide, large-scale surveys have been conducted to establish the occurrence and risk factors of depression. However, accurately estimating epidemiological factors leading up to depression has remained challenging. Deep-learning algorithms can be applied to assess the factors leading up to prevalence and clinical manifestations of depression.

Methods: Customized deep-neural-network and machine-learning classifiers were assessed using survey data from 19,725 participants from the NHANES database (from 1999 through 2014) and 4949 from the South Korea NHANES (K-NHANES) database in 2014.

Results: A deep-learning algorithm showed area under the receiver operating characteristic curve (AUCs) of 0.91 and 0.89 for detecting depression in NHANES and K-NHANES, respectively. The deep-learning algorithm trained with serial datasets (NHANES, from 1999 to 2012), predicted the prevalence of depression in the following two years of data (NHANES, 2013 and 2014) with an AUC of 0.92. Machine learning classifiers trained with NHANES could further predict depression in K-NHANES. There, logistic regression had the highest performance (AUC, 0.77) followed by deep learning algorithm (AUC, 0.74).

Conclusions: Deep neural-networks managed to identify depression well from other health and demographic factors in both the NHANES and K-NHANES datasets. The deep-learning algorithm was also able to predict depression relatively well on new data set-cross temporally and cross nationally. Further research can delineate the clinical implications of machine learning and deep learning in detecting disease prevalence and progress as well as other risk factors for depression and other mental illnesses.

Keywords: Deep learning; Depression; Machine learning; National Health and Nutrition Examination Survey.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Algorithms*
  • Databases, Factual
  • Deep Learning*
  • Depression / epidemiology*
  • Female
  • Humans
  • Machine Learning
  • Male
  • Middle Aged
  • Neural Networks, Computer
  • Nutrition Surveys
  • ROC Curve
  • Republic of Korea
  • Risk Factors