Protocol to analyze population structure and migration history based on human genome variation data

STAR Protoc. 2023 Mar 17;4(1):101928. doi: 10.1016/j.xpro.2022.101928. Epub 2022 Dec 13.

Abstract

We describe a protocol to integrate genome variation data from different datasets and explore the population structure and migration history of human populations. This protocol provides semi-automated scripts to perform and visualize the effect of variant filtering strategy on eliminating batch effects, principal component analysis, ancestry component analysis, historical population effective size inference, and migration and isolation analysis based on independent biallelic SNPs, genotype likelihoods, and haplotypes. The protocol can be adapted to variation data from other sources. For complete details on the use and execution of this protocol, please refer to Zhang et al. (2022).1.

Keywords: Bioinformatics; Computer sciences; Evolutionary biology; Genomics.

MeSH terms

  • Genome, Human* / genetics
  • Genotype
  • Humans
  • Polymorphism, Single Nucleotide* / genetics
  • Probability