Genome-wide microsatellite identification in the fungus Anisogramma anomala using Illumina sequencing and genome assembly

PLoS One. 2013 Nov 27;8(11):e82408. doi: 10.1371/journal.pone.0082408. eCollection 2013.

Abstract

High-throughput sequencing has been dramatically accelerating the discovery of microsatellite markers (also known as Simple Sequence Repeats). Both 454 and Illumina reads have been used directly in microsatellite discovery and primer design (the "Seq-to-SSR" approach). However, constraints of this approach include: 1) many microsatellite-containing reads do not have sufficient flanking sequences to allow primer design, and 2) difficulties in removing microsatellite loci residing in longer, repetitive regions. In the current study, we applied the novel "Seq-Assembly-SSR" approach to overcome these constraints in Anisogramma anomala. In our approach, Illumina reads were first assembled into a draft genome, and the latter was then used in microsatellite discovery. A. anomala is an obligate biotrophic ascomycete that causes eastern filbert blight disease of commercial European hazelnut. Little is known about its population structure or diversity. Approximately 26 M 146 bp Illumina reads were generated from a paired-end library of a fungal strain from Oregon. The reads were assembled into a draft genome of 333 Mb (excluding gaps), with contig N50 of 10,384 bp and scaffold N50 of 32,987 bp. A bioinformatics pipeline identified 46,677 microsatellite motifs at 44,247 loci, including 2,430 compound loci. Primers were successfully designed for 42,923 loci (97%). After removing 2,886 loci close to assembly gaps and 676 loci in repetitive regions, a genome-wide microsatellite database of 39,361 loci was generated for the fungus. In experimental screening of 236 loci using four geographically representative strains, 228 (96.6%) were successfully amplified and 214 (90.7%) produced single PCR products. Twenty-three (9.7%) were found to be perfect polymorphic loci. A small-scale population study using 11 polymorphic loci revealed considerable gene diversity. Clustering analysis grouped isolates of this fungus into two clades in accordance with their geographic origins. Thus, the "Seq-Assembly-SSR" approach has proven to be a successful one for microsatellite discovery.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Ascomycota / genetics*
  • Databases, Genetic
  • Genetic Markers
  • Genome, Fungal*
  • Microsatellite Repeats / genetics*

Substances

  • Genetic Markers

Grants and funding

Funding was provided by the New Jersey Agricultural Experiment Station, the Rutgers Center for Turfgrass Science, Hatch funds provided by USDA-NIFA, and the USDA-NIFA Specialty Crops Research Initiative Competitive Grant 2009-51181-06028. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.