Deep Learning Framework for Complex Disease Risk Prediction Using Genomic Variations

Sensors (Basel). 2023 May 1;23(9):4439. doi: 10.3390/s23094439.

Abstract

Genome-wide association studies have proven their ability to improve human health outcomes by identifying genotypes associated with phenotypes. Various works have attempted to predict the risk of diseases for individuals based on genotype data. This prediction can either be considered as an analysis model that can lead to a better understanding of gene functions that underlie human disease or as a black box in order to be used in decision support systems and in early disease detection. Deep learning techniques have gained more popularity recently. In this work, we propose a deep-learning framework for disease risk prediction. The proposed framework employs a multilayer perceptron (MLP) in order to predict individuals' disease status. The proposed framework was applied to the Wellcome Trust Case-Control Consortium (WTCCC), the UK National Blood Service (NBS) Control Group, and the 1958 British Birth Cohort (58C) datasets. The performance comparison of the proposed framework showed that the proposed approach outperformed the other methods in predicting disease risk, achieving an area under the curve (AUC) up to 0.94.

Keywords: GWAS; SNP; complex diseases risk prediction; feature selection; machine learning; mutual information.

MeSH terms

  • Deep Learning*
  • Genome-Wide Association Study
  • Genomics
  • Genotype
  • Humans
  • Neural Networks, Computer

Grants and funding

This work was supported through the Annual Funding track by the Deanship of Scientific Research, Vice Presidency for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia (Project No. GRANT 3306).