Development and characterization of genomic and expressed SSRs in citrus by genome-wide analysis

PLoS One. 2013 Oct 28;8(10):e75149. doi: 10.1371/journal.pone.0075149. eCollection 2013.

Abstract

Microsatellites or simple sequence repeats (SSRs) are one of the most popular sources of genetic markers and play a significant role in plant genetics and breeding. In this study, we identified citrus SSRs in the genome of Clementine mandarin and analyzed their frequency and distribution in different genomic regions. A total of 80,708 SSRs were detected in the genome with an overall density of 268 SSRs/Mb. While di-nucleotide repeats were the most frequent microsatellites in genomic DNA sequence, tetra-nucleotides, which had more repeat units than any other SSR types, had the highest cumulative sequence length. We identified 6,834 transcripts as containing 8,989 SSRs in 33,929 Clementine mandarin transcripts, among which, tri-nucleotide motifs (36.0%) were the most common, followed by di-nucleotide (26.9%) and hexa-nucleotide motifs (15.1%). The motif AG (16.7%) was most abundant among these SSRs, while motifs AAG (6.6%), AAT (5.0%), and TAG (2.2%) were most common among tri-nucleotides. Functional categorization of transcripts containing SSRs revealed that 5,879 (86.0%) of such transcripts had homology with known proteins, GO and KEGG annotation revealed that transcripts containing SSRs were those implicated in diverse biological processes in plants, including binding, development, transcription, and protein degradation. When 27 genomic and 78 randomly selected SSRs were tested on Clementine mandarin, 95 SSRs revealed polymorphism. These 95 SSRs were further deployed on 18 genotypes of the three generas of Rutaceae for the genetic diversity assessment, genomic SSRs generally show low transferability in comparison to SSRs developed from expressed sequences. These transcript-markers identified in our study may provide a valuable genetic and genomic tool for further genetic research and varietal development in citrus, such as diversity study, QTL mapping, molecular breeding, comparative mapping and other genetic analyses.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Citrus / classification
  • Citrus / genetics
  • Genome, Plant
  • Genome-Wide Association Study
  • Genomics*
  • Microsatellite Repeats / genetics*
  • Molecular Sequence Annotation
  • Nucleotide Motifs
  • Open Reading Frames
  • Phylogeny
  • Polymorphism, Genetic
  • Species Specificity
  • Transcriptome
  • Untranslated Regions

Substances

  • Untranslated Regions

Grants and funding

This research was supported financially by the Fundamental Research Funds for the Central Universities (2013PY083) and the National Natural Science Foundation of China (grant nos. 31071777, 31130046, and 31101528). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.