rBahadur: efficient simulation of structured high-dimensional genotype data with applications to assortative mating

BMC Bioinformatics. 2023 Aug 18;24(1):314. doi: 10.1186/s12859-023-05442-6.

Abstract

Existing methods for generating synthetic genotype data are ill-suited for replicating the effects of assortative mating (AM). We propose rb_dplr, a novel and computationally efficient algorithm for generating high-dimensional binary random variates that effectively recapitulates AM-induced genetic architectures using the Bahadur order-2 approximation of the multivariate Bernoulli distribution. The rBahadur R library is available through the Comprehensive R Archive Network at https://CRAN.R-project.org/package=rBahadur .

Keywords: Assortative mating; Genotype simulation; Multivariate Bernoulli.

MeSH terms

  • Algorithms*
  • Binomial Distribution
  • Cell Communication*
  • Computer Simulation
  • Genotype