Genetic prediction of type 2 diabetes using deep neural network

Clin Genet. 2018 Apr;93(4):822-829. doi: 10.1111/cge.13175. Epub 2018 Feb 20.

Abstract

Type 2 diabetes (T2DM) has strong heritability but genetic models to explain heritability have been challenging. We tested deep neural network (DNN) to predict T2DM using the nested case-control study of Nurses' Health Study (3326 females, 45.6% T2DM) and Health Professionals Follow-up Study (2502 males, 46.5% T2DM). We selected 96, 214, 399, and 678 single-nucleotide polymorphism (SNPs) through Fisher's exact test and L1-penalized logistic regression. We split each dataset randomly in 4:1 to train prediction models and test their performance. DNN and logistic regressions showed better area under the curve (AUC) of ROC curves than the clinical model when 399 or more SNPs included. DNN was superior than logistic regressions in AUC with 399 or more SNPs in male and 678 SNPs in female. Addition of clinical factors consistently increased AUC of DNN but failed to improve logistic regressions with 214 or more SNPs. In conclusion, we show that DNN can be a versatile tool to predict T2DM incorporating large numbers of SNPs and clinical information. Limitations include a relatively small number of the subjects mostly of European ethnicity. Further studies are warranted to confirm and improve performance of genetic prediction models using DNN in different ethnic groups.

Keywords: deep neural network; genomic microarray; prediction model; type 2 diabetes.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Diabetes Mellitus, Type 2 / genetics*
  • Diabetes Mellitus, Type 2 / metabolism
  • Diabetes Mellitus, Type 2 / physiopathology
  • Female
  • Follow-Up Studies
  • Genetic Association Studies / statistics & numerical data
  • Genetic Predisposition to Disease*
  • Humans
  • Logistic Models
  • Male
  • Models, Genetic
  • Neural Networks, Computer
  • Polymorphism, Single Nucleotide / genetics