Simplified detection of genetic background admixture using artificial intelligence

Clin Genet. 2024 Apr 1. doi: 10.1111/cge.14527. Online ahead of print.

Abstract

Admixture refers to the mixing of genetic ancestry from different populations. Admixture is important for genomic medicine because it can affect how an individual responds to certain medications, how they metabolize drugs, and susceptibility to certain diseases. For example, some genetic variants associated with drug metabolism and response may be more common in certain populations, and individuals with admixed ancestry may have a different frequency of these variants than individuals from the ancestral populations. Understanding the patterns of admixture in a population can also help researchers identify new genetic variants associated with diseases or traits and develop more personalized and targeted treatments. In this study, we compared and classified the known and self-reported genetic backgrounds from 1000 Genomes Project and admixed samples from GTEx projects using supervised, unsupervised and statistical classification methodologies. We developed a novel tool called Admix-AI that uses a one-dimensional convolutional neural network to understand and classify admixed genetic backgrounds using 213 DNA-marker based genetic background labels. Admix-AI can be used to discover admixed proportions in samples and ultimately aid personalized genomic medicine by identifying specific biomarker systems. We compared Admix-AI to the existing admixture categorization software and found our tool to be computationally faster with 2× speedup and streamlined usage. Admix-AI is available as open-source code under GPL version 3.0 license at https://github.com/rpauly/Admix-AI.

Keywords: admixture; artificial intelligence; diversity; genetics.