Validation of Genotyping-By-Sequencing Analysis in Populations of Tetraploid Alfalfa by 454 Sequencing

PLoS One. 2015 Jun 26;10(6):e0131918. doi: 10.1371/journal.pone.0131918. eCollection 2015.

Abstract

Genotyping-by-sequencing (GBS) is a relatively low-cost high throughput genotyping technology based on next generation sequencing and is applicable to orphan species with no reference genome. A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations. GBS was performed on ApeKI libraries using DNA from 48 genotypes each of two heterogeneous populations of tetraploid alfalfa (Medicago sativa spp. sativa): the synthetic cultivar Apica (ATF0) and a derived population (ATF5) obtained after five cycles of recurrent selection for superior tolerance to freezing (TF). Nearly 400 million reads were obtained from two lanes of an Illumina HiSeq 2000 sequencer and analyzed with the Universal Network-Enabled Analysis Kit (UNEAK) pipeline designed for species with no reference genome. Following the application of whole dataset-level filters, 11,694 single nucleotide polymorphism (SNP) loci were obtained. About 60% had a significant match on the Medicago truncatula syntenic genome. The accuracy of allelic ratios and genotype calls based on GBS data was directly assessed using 454 sequencing on a subset of SNP loci scored in eight plant samples. Sequencing depth in this study was not sufficient for accurate tetraploid allelic dosage, but reliable genotype calls based on diploid allelic dosage were obtained when using additional quality filtering. Principal Component Analysis of SNP loci in plant samples revealed that a small proportion (<5%) of the genetic variability assessed by GBS is able to differentiate ATF0 and ATF5. Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Alleles
  • Base Sequence
  • Chromosome Mapping / methods
  • DNA Mutational Analysis / methods
  • Genome, Plant
  • Genotyping Techniques / methods*
  • High-Throughput Nucleotide Sequencing* / methods
  • Medicago sativa / classification
  • Medicago sativa / genetics*
  • Medicago truncatula / genetics
  • Molecular Sequence Data
  • Polymorphism, Single Nucleotide
  • Sequence Homology, Nucleic Acid
  • Tetraploidy*

Grants and funding

This project was supported by a competitive grant of Agriculture and Agri-Food Canada to YC and by a grant from the “Centre Sève“ of the “Fonds de recherche du Québec – Nature et technologies” to YC and FB. The participation of SR to this project was supported by a Natural Sciences and Engineering Research Council of Canada visiting fellowship in Canadian federal government laboratories. The founders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.