Covariance Between Genotypic Effects and its Use for Genomic Inference in Half-Sib Families

G3 (Bethesda). 2016 Sep 8;6(9):2761-72. doi: 10.1534/g3.116.032409.

Abstract

In livestock, current statistical approaches utilize extensive molecular data, e.g., single nucleotide polymorphisms (SNPs), to improve the genetic evaluation of individuals. The number of model parameters increases with the number of SNPs, so the multicollinearity between covariates can affect the results obtained using whole genome regression methods. In this study, dependencies between SNPs due to linkage and linkage disequilibrium among the chromosome segments were explicitly considered in methods used to estimate the effects of SNPs. The population structure affects the extent of such dependencies, so the covariance among SNP genotypes was derived for half-sib families, which are typical in livestock populations. Conditional on the SNP haplotypes of the common parent (sire), the theoretical covariance was determined using the haplotype frequencies of the population from which the individual parent (dam) was derived. The resulting covariance matrix was included in a statistical model for a trait of interest, and this covariance matrix was then used to specify prior assumptions for SNP effects in a Bayesian framework. The approach was applied to one family in simulated scenarios (few and many quantitative trait loci) and using semireal data obtained from dairy cattle to identify genome segments that affect performance traits, as well as to investigate the impact on predictive ability. Compared with a method that does not explicitly consider any of the relationship among predictor variables, the accuracy of genetic value prediction was improved by 10-22%. The results show that the inclusion of dependence is particularly important for genomic inference based on small sample sizes.

Keywords: Bayesian statistics; SNP effect; autoregressive prior; linkage disequilibrium; recombination rate.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bayes Theorem
  • Cattle
  • Genetic Linkage*
  • Genetics, Population
  • Genome / genetics*
  • Genomics
  • Genotype
  • Haplotypes
  • Linkage Disequilibrium
  • Models, Genetic
  • Pedigree
  • Polymorphism, Single Nucleotide / genetics*
  • Quantitative Trait Loci / genetics*
  • Siblings