Computational and experimental characterization of physically clustered simple sequence repeats in plants

Genetics. 2000 Oct;156(2):847-54. doi: 10.1093/genetics/156.2.847.

Abstract

The type and frequency of simple sequence repeats (SSRs) in plant genomes was investigated using the expanding quantity of DNA sequence data deposited in public databases. In Arabidopsis, 306 genomic DNA sequences longer than 10 kb and 36,199 EST sequences were searched for all possible mono- to pentanucleotide repeats. The average frequency of SSRs was one every 6.04 kb in genomic DNA, decreasing to one every 14 kb in ESTs. SSR frequency and type differed between coding, intronic, and intergenic DNA. Similar frequencies were found in other plant species. On the basis of these findings, an approach is proposed and demonstrated for the targeted isolation of single or multiple, physically clustered SSRs linked to any gene that has been mapped using low-copy DNA-based markers. The approach involves sample sequencing a small number of subclones of selected randomly sheared large insert DNA clones (e.g., BACs). It is shown to be both feasible and practicable, given the probability of fortuitously sequencing through an SSR. The approach is demonstrated in barley where sample sequencing 34 subclones of a single BAC selected by hybridization to the Big1 gene revealed three SSRs. These allowed Big1 to be located at the top of barley linkage group 6HS.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics*
  • Base Sequence
  • Chromosomes, Artificial, Bacterial
  • Cloning, Molecular
  • DNA Primers
  • DNA, Plant / chemistry*
  • DNA, Plant / genetics*
  • Databases as Topic
  • Expressed Sequence Tags
  • Genome, Plant*
  • Hordeum / genetics
  • Plants / genetics*
  • Probability
  • Repetitive Sequences, Nucleic Acid*

Substances

  • DNA Primers
  • DNA, Plant