Using Stochastic Approximation Techniques to Efficiently Construct Confidence Intervals for Heritability

Regev Schweiger; Eyal Fisher; Elior Rahmani; Liat Shenhav; Saharon Rosset; Eran Halperin

doi:10.1089/cmb.2018.0047

Using Stochastic Approximation Techniques to Efficiently Construct Confidence Intervals for Heritability

J Comput Biol. 2018 Jul;25(7):794-808. doi: 10.1089/cmb.2018.0047. Epub 2018 Jun 22.

Authors

Regev Schweiger¹, Eyal Fisher², Elior Rahmani¹, Liat Shenhav², Saharon Rosset², Eran Halperin^{3

4}

Affiliations

¹ 1 Blavatnik School of Computer Science, Tel Aviv University , Tel Aviv, Israel .
² 2 Department of Statistics, School of Mathematical Sciences, Tel Aviv University , Tel Aviv, Israel .
³ 3 Department of Computer Science, University of California , Los Angeles, California.
⁴ 4 Department of Anesthesiology and Perioperative Medicine, University of California , Los Angeles, California.

PMID: 29932739
DOI: 10.1089/cmb.2018.0047

Abstract

Estimation of heritability is an important task in genetics. The use of linear mixed models (LMMs) to determine narrow-sense single-nucleotide polymorphism (SNP)-heritability and related quantities has received much recent attention, due of its ability to account for variants with small effect sizes. Typically, heritability estimation under LMMs uses the restricted maximum likelihood (REML) approach. The common way to report the uncertainty in REML estimation uses standard errors (SEs), which rely on asymptotic properties. However, these assumptions are often violated because of the bounded parameter space, statistical dependencies, and limited sample size, leading to biased estimates and inflated or deflated confidence intervals (CIs). In addition, for larger data sets (e.g., tens of thousands of individuals), the construction of SEs itself may require considerable time, as it requires expensive matrix inversions and multiplications. Here, we present FIESTA (Fast confidence IntErvals using STochastic Approximation), a method for constructing accurate CIs. FIESTA is based on parametric bootstrap sampling, and, therefore, avoids unjustified assumptions on the distribution of the heritability estimator. FIESTA uses stochastic approximation techniques, which accelerate the construction of CIs by several orders of magnitude, compared with previous approaches as well as to the analytical approximation used by SEs. FIESTA builds accurate CIs rapidly, for example, requiring only several seconds for data sets of tens of thousands of individuals, making FIESTA a very fast solution to the problem of building accurate CIs for heritability for all data set sizes.

Keywords: confidence intervals; heritability; stochastic approximation.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computer Simulation
Genome-Wide Association Study / statistics & numerical data*
Genotype
Humans
Models, Statistical*
Phenotype
Polymorphism, Single Nucleotide / genetics
Quantitative Trait Loci / genetics*
Software

Grants and funding

076113/WT_/Wellcome Trust/United Kingdom