Improved computations for relationship inference using low-coverage sequencing data

BMC Bioinformatics. 2023 Mar 9;24(1):90. doi: 10.1186/s12859-023-05217-z.

Abstract

Pedigree inference, for example determining whether two persons are second cousins or unrelated, can be done by comparing their genotypes at a selection of genetic markers. When the data for one or more of the persons is from low-coverage next generation sequencing (lcNGS), currently available computational methods either ignore genetic linkage or do not take advantage of the probabilistic nature of lcNGS data, relying instead on first estimating the genotype. We provide a method and software (see familias.name/lcNGS) bridging the above gap. Simulations indicate how our results are considerably more accurate compared to some previously available alternatives. Our method, utilizing a version of the Lander-Green algorithm, uses a group of symmetries to speed up calculations. This group may be of further interest in other calculations involving linked loci.

Keywords: Bayesian; LcNGS; Pedigree inference.

MeSH terms

  • Algorithms*
  • Genetic Linkage
  • Genotype
  • Models, Genetic
  • Pedigree
  • Polymorphism, Single Nucleotide
  • Software*