A weighted empirical Bayes risk prediction model using multiple traits

Stat Appl Genet Mol Biol. 2020 Sep 4;19(3):/j/sagmb.2020.19.issue-3/sagmb-2019-0056/sagmb-2019-0056.xml. doi: 10.1515/sagmb-2019-0056.

Abstract

With rapid advances in high-throughput sequencing technology, millions of single-nucleotide variants (SNVs) can be simultaneously genotyped in a sequencing study. These SNVs residing in functional genomic regions such as exons may play a crucial role in biological process of the body. In particular, non-synonymous SNVs are closely related to the protein sequence and its function, which are important in understanding the biological mechanism of sequence evolution. Although statistically challenging, models incorporating such SNV annotation information can improve the estimation of genetic effects, and multiple responses may further strengthen the signals of these variants on the assessment of disease risk. In this work, we develop a new weighted empirical Bayes method to integrate SNV annotation information in a multi-trait design. The performance of this proposed model is evaluated in simulation as well as a real sequencing data; thus, the proposed method shows improved prediction accuracy compared to other approaches.

Keywords: empirical Bayes; next-generation sequencing; rare variants; risk prediction.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Computer Simulation
  • Databases, Genetic
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Models, Statistical
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci
  • Risk Assessment / methods