Targeted resequencing of coding DNA sequences for SNP discovery in nonmodel species

Mol Ecol Resour. 2018 Nov;18(6):1356-1373. doi: 10.1111/1755-0998.12924. Epub 2018 Jul 30.

Abstract

Targeted capture coupled with high-throughput sequencing can be used to gain information about nuclear sequence variation at hundreds to thousands of loci. Divergent reference capture makes use of molecular data of one species to enrich target loci in other (related) species. This is particularly valuable for nonmodel organisms, for which often no a priori knowledge exists regarding these loci. Here, we have used targeted capture to obtain data for 809 nuclear coding DNA sequences (CDS) in a nonmodel organism, the Eurasian lynx Lynx lynx, using baits designed with the help of the published genome of a related model organism (the domestic cat Felis catus). Using this approach, we were able to survey intraspecific variation at hundreds of nuclear loci in L. lynx across the species' European range. A large set of biallelic candidate SNPs was then evaluated using a high-throughput SNP genotyping platform (Fluidigm), which we then reduced to a final 96 SNP-panel based on assay performance and reliability; validation was carried out with 100 additional Eurasian lynx samples not included in the SNP discovery phase. The 96 SNP-panel developed from CDS performed very successfully in the identification of individuals and in population genetic structure inference (including the assignment of individuals to their source population). In keeping with recent studies, our results show that genic SNPs can be valuable for genetic monitoring of wildlife species.

Keywords: CDS; Eurasian lynx; conservation genetics; genetic monitoring; hybridization capture; single nucleotide polymorphism.

MeSH terms

  • Animals
  • Cats / genetics
  • Computational Biology / methods*
  • Genotype
  • Genotyping Techniques / methods*
  • Lynx / classification*
  • Lynx / genetics*
  • Polymorphism, Single Nucleotide*
  • Sequence Analysis, DNA / methods*

Associated data

  • GENBANK/SRP116616