Power of a dual-use SNP panel for pedigree reconstruction and population assignment

Ecol Evol. 2020 Aug 10;10(17):9522-9531. doi: 10.1002/ece3.6645. eCollection 2020 Sep.

Abstract

The use of high-throughput, low-density sequencing approaches has dramatically increased in recent years in studies of eco-evolutionary processes in wild populations and domestication in commercial aquaculture. Most of these studies focus on identifying panels of SNP loci for a single downstream application, whereas there have been few studies examining the trade-offs for selecting panels of markers for use in multiple applications. Here, we detail the use of a bioinformatic workflow for the development of a dual-purpose SNP panel for parentage and population assignment, which included identifying putative SNP loci, filtering for the most informative loci for the two tasks, designing effective multiplex PCR primers, optimizing the SNP panel for performance, and performing quality control steps for downstream applications. We applied this workflow to two adjacent Alaskan Sockeye Salmon populations and identified a GTseq panel of 142 SNP loci for parentage and 35 SNP loci for population assignment. Only 50-75 panel loci were necessary for >95% accurate parentage, whereas population assignment success, with all 172 panel loci, ranged from 93.9% to 96.2%. Finally, we discuss the trade-offs and complexities of the decision-making process that drives SNP panel development, optimization, and testing.

Keywords: GTseq; Sockeye; amplicon panel; parentage; population assignment.