A multiple coefficient of determination-based method for parsing SNPs that correlate with mRNA expression

Sci Rep. 2019 Dec 27;9(1):20110. doi: 10.1038/s41598-019-56494-9.

Abstract

In this study, we present a novel, multiple coefficient of determination (R2M)-based method for parsing SNPs located within the chromosomal neighborhood of a gene into semi-independent families, each of which corresponds to one or more functional variants that regulate transcription of the gene. Specifically, our method utilizes a matrix equation framework to calculate R2M values for SNPs within a chromosome region of interest (ROI) based upon the choices of 1-4 "index" SNPs (iSNPs) that serve as proxies for underlying regulatory variants. Exhaustive testing of sets of 1-4 candidate iSNPs identifies iSNP models that best account for estimated R2 values derived from single-variable linear regression analysis of correlations between mRNA expression and genotypes of individual SNPs. Subsequent genotype-based estimation of pairwise r2 linkage disequilibrium (LD) coefficients between each iSNP and the other ROI SNPs allows the SNPs to be parsed into semi-independent families. Analysis of mRNA expression and genotypes data downloaded from Gene Expression Omnibus (GEO) and database for Genotypes and Phenotypes (dbGAP) demonstrates the usefulness of this method for parsing SNPs based on experimental data. We believe that this method will be widely applicable for the analysis of the genetic basis of mRNA expression and visualizing the contributions of multiple genetic variants to the regulation of individual genes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Brain / metabolism
  • Cell Line
  • Computational Biology
  • Gene Expression Profiling
  • Genetic Association Studies* / methods
  • Genetic Predisposition to Disease*
  • Genotype
  • Humans
  • Methylenetetrahydrofolate Reductase (NADPH2) / genetics
  • Polymorphism, Single Nucleotide*
  • RNA, Messenger*
  • Transcriptome

Substances

  • RNA, Messenger
  • MTHFR protein, human
  • Methylenetetrahydrofolate Reductase (NADPH2)