Inferring bio-geographical ancestry with 35 microhaplotypes

Forensic Sci Int. 2022 Dec:341:111509. doi: 10.1016/j.forsciint.2022.111509. Epub 2022 Nov 1.

Abstract

During the past decades, with the blooming of large-scale human genetic studies, we are beginning to understand how bio-geographical information could be reflected by genetic variations. And the technological advance in massively parallel sequencing gives advantages to some novel forensic markers such as microhaplotype (BIM). In the present study, we selected and characterized 35 novel bio-geographical informative BIMs based on the 1000 Genomes Project (1KG). All loci had short lengths less than 100 bp, high effective allele numbers (Ae) from 1.875 to 3.980 with an average of 2.798, and high informativeness (In) value from 0.701 to 0.865 with an average of 0.748, which indicates the 35 BIMs possessed great discriminating power. Using these 35 BIMs, the individuals from 1KG could be successfully differentiated into five supergroups defined by 1KG (AFR, AMR, EAS, EUR, and SAS). We also included some individuals from the Simons Genome Diversity Project (SGDP) for further validation. As a result, most individuals could be accurately predicted except for those from super-populations that do not exist in 1KG dataset. In conclusion, the present novel 35 BIMs could be a useful tool for bio-geographical ancestry inference.

Keywords: Ancestry inference; Bio-geographic ancestry; Forensic marker; Microhaplotype; Population structure.

MeSH terms

  • Gene Frequency
  • Genetics, Population*
  • Haplotypes
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Polymorphism, Single Nucleotide*