Imputation of missing genotypes from low- to high-density SNP panel in different population designs

Anim Genet. 2015 Feb;46(1):1-7. doi: 10.1111/age.12236. Epub 2014 Nov 28.

Abstract

Imputation of missing genotypes, in particular from low density to high density, is an important issue in genomic selection and genome-wide association studies. Given the marker densities, the most important factors affecting imputation accuracy are the size of the reference population and the relationship between individuals in the reference (genotyped with high-density panel) and study (genotyped with low-density panel) populations. In this study, we investigated the imputation accuracies when the reference population (genotyped with Illumina BovineSNP50 SNP panel) contained sires, halfsibs, or both sires and halfsibs of the individuals in the study population (genotyped with Illumina BovineLD SNP panel) using three imputation programs (fimpute v2.2, findhap v2, and beagle v3.3.2). Two criteria, correlation between true and imputed genotypes and missing rate after imputation, were used to evaluate the performance of the three programs in different scenarios. Our results showed that fimpute performed the best in all cases, with correlations from 0.921 to 0.978 when imputing from sires to their daughters or between halfsibs. In general, the accuracies of imputing between halfsibs or from sires to their daughters were higher than were those imputing between non-halfsibs or from sires to non-daughters. Including both sires and halfsibs in the reference population did not improve the imputation performance in comparison with when only including halfsibs in the reference population for all the three programs.

Keywords: low density; reference population size; relationship.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Breeding
  • Cattle / genetics*
  • Female
  • Genetic Association Studies
  • Genotype*
  • Genotyping Techniques*
  • Male
  • Polymorphism, Single Nucleotide*
  • Population Density
  • Reference Values
  • Software*