Required properties for markers used to calculate unbiased estimates of the genetic correlation between populations

Genet Sel Evol. 2018 Dec 14;50(1):65. doi: 10.1186/s12711-018-0434-6.

Abstract

Background: Generally, populations differ in terms of environmental and genetic factors, which can create differences in allele substitution effects between populations. Therefore, a single genotype may have different additive genetic values in different populations. The correlation between the two additive genetic values of a single genotype in two populations is known as the additive genetic correlation between populations and thus, can differ from 1. Our objective was to investigate whether differences in linkage disequilibrium (LD) and allele frequencies of markers and causal loci between populations affect the bias of the estimated genetic correlation. We simulated two populations that were separated by 50 generations and differed in LD pattern between markers and causal loci, as measured by the LD-statistic r. We used a high marker density to represent a high consistency of LD between populations, and lower marker densities to represent situations with a lower consistency of LD between populations. Markers and causal loci were selected to have either similar or different allele frequencies in the two populations.

Results: Our results show that genetic correlations were underestimated only slightly when the difference in allele frequencies between the two populations was similar for the markers and the causal loci. A lower marker density, representing a lower consistency of LD between populations, had only a minor effect on the underestimation of the genetic correlation. When the difference in allele frequencies between the two populations was not similar for markers and causal loci, genetic correlations were severely underestimated. This bias occurred because the markers did not predict accurately the relationships at causal loci.

Conclusions: For an unbiased estimation of the genetic correlation between populations, the markers should accurately predict the relationships at the causal loci. To achieve this, it is essential that the difference in allele frequencies between populations is similar for markers and causal loci. Our results show that differences in LD phase between causal loci and markers across populations have little effect on the estimated genetic correlation.

MeSH terms

  • Alleles
  • Bias
  • Biomarkers
  • Computer Simulation
  • Gene Frequency / genetics
  • Genetic Markers / genetics*
  • Genetics, Population / methods*
  • Genetics, Population / statistics & numerical data
  • Genotype
  • Linkage Disequilibrium / genetics*
  • Polymorphism, Single Nucleotide / genetics

Substances

  • Biomarkers
  • Genetic Markers