Evidence of rare misassemblies in the bovine reference genome revealed by population genetic metrics

Anim Genet. 2022 Aug;53(4):498-505. doi: 10.1111/age.13205. Epub 2022 May 1.

Abstract

Creation of the bovine reference assembly paved the way to develop the high-throughput genotyping arrays of the single nucleotide polymorphisms (SNPs) based on the available map coordinates that facilitated major advances in gene mapping and selection programs. The assembly flaws, however, may cause false results in the downstream gene mapping studies. The most recent bovine reference genome (ARS-UCD1.2) is built on the long-read sequences that provides improved quality and continuity. By applying population genetic metrics in this study, we aimed to evaluate the map coordinates to which SNP markers were assigned. We employed a three-step approach by combining the recombination and linkage disequilibrium analyses to test if the markers fit into the assigned physical map coordinates. We applied the method to the bovine 50k array in a large pedigree of Holstein cattle and revealed a panel of 65 candidate markers, most of which were re-located either on a different chromosome or re-mapped as far as several million base pairs away on the same chromosome. This list of candidates accounts for 0.1% of the SNPs in the widely used 50k genotyping array and we foresee a reasonably larger set of markers being misplaced in the BovineHD 700K BeadChip. We suggest pre-removal of the candidate misplaced markers to reduce false signals in association mapping studies.

Keywords: linkage disequilibrium; misplaced marker; recombination.

MeSH terms

  • Animals
  • Cattle / genetics
  • Chromosome Mapping
  • Genetic Linkage
  • Genetics, Population*
  • Pedigree
  • Polymorphism, Single Nucleotide*