We describe a protocol to integrate genome variation data from different datasets and explore the population structure and migration history of human populations. This protocol provides semi-automated scripts to perform and visualize the effect of variant filtering strategy on eliminating batch effects, principal component analysis, ancestry component analysis, historical population effective size inference, and migration and isolation analysis based on independent biallelic SNPs, genotype likelihoods, and haplotypes. The protocol can be adapted to variation data from other sources. For complete details on the use and execution of this protocol, please refer to Zhang et al. (2022).1.
Keywords: Bioinformatics; Computer sciences; Evolutionary biology; Genomics.
Copyright © 2022 The Authors. Published by Elsevier Inc. All rights reserved.