A next-generation sequencing method for genotyping-by-sequencing of highly heterozygous autotetraploid potato

PLoS One. 2013 May 8;8(5):e62355. doi: 10.1371/journal.pone.0062355. Print 2013.

Abstract

Assessment of genomic DNA sequence variation and genotype calling in autotetraploids implies the ability to distinguish among five possible alternative allele copy number states. This study demonstrates the accuracy of genotyping-by-sequencing (GBS) of a large collection of autotetraploid potato cultivars using next-generation sequencing. It is still costly to reach sufficient read depths on a genome wide scale, across the cultivated gene pool. Therefore, we enriched cultivar-specific DNA sequencing libraries using an in-solution hybridisation method (SureSelect). This complexity reduction allowed to confine our study to 807 target genes distributed across the genomes of 83 tetraploid cultivars and one reference (DM 1-3 511). Indexed sequencing libraries were paired-end sequenced in 7 pools of 12 samples using Illumina HiSeq2000. After filtering and processing the raw sequence data, 12.4 Gigabases of high-quality sequence data was obtained, which mapped to 2.1 Mb of the potato reference genome, with a median average read depth of 63× per cultivar. We detected 129,156 sequence variants and genotyped the allele copy number of each variant for every cultivar. In this cultivar panel a variant density of 1 SNP/24 bp in exons and 1 SNP/15 bp in introns was obtained. The average minor allele frequency (MAF) of a variant was 0.14. Potato germplasm displayed a large number of relatively rare variants and/or haplotypes, with 61% of the variants having a MAF below 0.05. A very high average nucleotide diversity (π = 0.0107) was observed. Nucleotide diversity varied among potato chromosomes. Several genes under selection were identified. Genotyping-by-sequencing results, with allele copy number estimates, were validated with a KASP genotyping assay. This validation showed that read depths of ∼60-80× can be used as a lower boundary for reliable assessment of allele copy number of sequence variants in autotetraploids. Genotypic data were associated with traits, and alleles strongly influencing maturity and flesh colour were identified.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Gene Frequency
  • Gene Library
  • Genetic Variation*
  • Genome, Plant / genetics*
  • Genotype
  • Heterozygote
  • High-Throughput Nucleotide Sequencing / methods*
  • Solanum tuberosum / genetics*
  • Tetraploidy*

Grants and funding

This research was supported by a grant from the Dutch technology foundation STW, project WPB-7926. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.